Exploring European Restaurants on Tripadvisor

Final Project
Data Science 1 with R (STAT 301-1)

Author

Nikole Montero Cervantes

Published

November 29, 2023

Introduction

Having previously traversed around Europe, my fascination with the continent lies not only in its rich cultural tapestry but, more prominently, in its diverse and delectable cuisine. As a seasoned traveler, I’ve embarked on culinary adventures, navigating among European restaurants with a reliance on recommendations and reviews sourced from platforms such as TripAdvisor.

Motivated by my own experiences, I became intrigued by the complex interplay of elements that elevate a restaurant from ordinary to exceptional. The quest for exceptional dining experiences for budget-conscious travelers, without compromising on the quality of gastronomic delights, emerged as a compelling challenge. Thus, I found myself driven to explore the nuances of locating affordable restaurants that deliver outstanding food—an exploration intended to enhance travel experiences for individuals like myself.

This dataset^1 was formed through scraping TripAdvisor (TA), which I later obtained from Kaggle, a platform dedicated to data science.

Data Overview & Quality

The dataset contains a pool of 1,083,397 restaurants across European countries. There are 42 variables, among these variables, 25 are categorical and 17 are numericals.

The primary concern for this data analysis revolves around the existence of missing values in different variables. This issue has the potential to compromise the accuracy of the results, resulting in misleading interpretations. To address this, the data will undergo an intensive cleaning process to ensure that the Exploratory Data Analysis (EDA) yields meaningful insights.

Cleaning the data

In the process of cleaning the data, various essential string manipulation, functions and transformation techniques were employed using the dplyr and stringr packages in R. The dataset underwent a series of refinements to enhance its tidiness and facilitate downstream analyses. Key steps in the cleaning process include:

• Variable Renaming

• Creating and Modifying Variables

• Handling Categorical Data

• Text Processing

• List Manipulation

• Numeric Extraction

• Data Filtering and Handling Missing Values

Starting of EDA

Univariate Analysis

In order to find patterns or unusual trends, I started analyzing at each variable in the dataset.

For Restaurants

Considering there are too many observations, to facilitate my exploration process I decided to look at the 10 most common restaurants.

Figure 1: Distribution of Average Ratings

In Figure 1, it is appreciated that there are a number of restaurants for the same restaurant name. Thus, looking back at the data I realized that even though those restaurants have the same name, there are all in different cities. Taking as an example, Flunch:

Figure 2: Flunch Chain Restaurants Table

Thus, through Figure 2, I realized that those restaurants conformed a chain and that’s why there a more than one of them for those restaurants. Something particular interesting is that all the top restaurants chains are French. The restaurant chain with the highest amount of restaurants is Leon de Bruxelles.

For Average Rating

Figure 3: Distribution of Average Ratings

Figure 3 shows that the European restaurants in those 31 different countries on TripAdvisor, have a high rating, approximately between 4 to 4.8. This could suggest that the average quality offered in European restaurants is reallp.y good. This would be deeper study in the multivariate section.

For the Open Days Per Week

Figure 4: Count of Open Days per Week

In Figure 4 we see that most of the restaurants are open during the seven days of the week. That is followed by six and five days per week. That makes sense since restaurants should generally be open for five days or more in order to make profit.

However, there are some restaurants that are open for 4 days or less, which is atypical to see. The impact of this low openings amount would be explored in the multivariate section.

For Country

Figure 5: Restaurant Locations among different European countries

Figure 5 displays the number of restaurants per country. France has the highest number of restaurants in this dataset, which could potentially explain why the top 10 restaurants chain are French. Croatia and Finland are the countries with the least number of restaurants on TripAdvisor. France will be explore deeper in a later section.

For Average Price

Figure 6: Histogram-Distribution of Average Price

Figure 7: Box Plot-Distribution of Average Price

Figure 6 shows a histogram is right-skewed, with a mode around 20 to 30 euros. This could indicate that the majority of European restaurants that appear on TripAdvisor are affordable and generally do not exceed 50 euros. However, there are a some exceptions, which are seen through the outlines in Figure 7 with prices ranging from 100 euros to 500 euros.

For Price Level

Figure 8: Distribution of Price Levels

It is evident in Figure 8 that most of the restaurants are mid-range, aligning with what was observed in the average price plot above. This reinforces the idea that the food offered in the majority of the restaurants in this dataset is affordable and potentially budget-friendly.

For special Diets

Figure 9: Distribution of Special Diets

Figure 9 shows that most of the restaurants on TripAdvisor do not offer special diets in their menus. However, there is some presence of vegetarian options. There is also a possibility that, for some restaurants, it was unknown, so it was registered as if they do not offer special diets. Thus, the impact of special diets can be inaccurate, not meaningful for this EDA.

For Cuisines

Figure 10: Distribution of Special Diets

In @cuisines it is seen that most of the restaurants, more than 10000 restaurants, offer a European cuisine. This make sense, since the restaurants I am exploring are located in different European cities.

There is a moderate presence of restaurants, around 1875 ones, that work as bars too. Asian cuisine is also offered by around 1250 restaurants. African and North American cuisines have a lower presence in the menus of the European restaurants. Fusion and South American cuisine are barely offer in those restaurants. Oceania cuisine has the lowest presence in the restaurants within these database.

Multivariate Analysis

Location of the restaurants

Figure 11: Cities Interactive Scatter Plot

Through Figure 11, it is appreciated that most restaurants are located in France. This reinforces the univariate analysis that indicated France having the highest amount of restaurants in the dataset.

Food Top and Bottom Ratings

Since there are a lot observations, the plot will be complicated to read. Thus, to make the analysis more comprehensive, I decided to narrow the observations studied. Since most of the restaurants offer a cuisine in Europe, I decided to explore those restaurants to make the EDA more meaningful.

This filtered dataset will be used to explore the food, service and value ratings in this section.

Figure 12: Top 10 European Restaurants based on Food Rating

In Figure 12 it is seen that the top 20 restaurants posses a a food rating of 5 out of 5. This means that the quality of the European cuisine is not only affordable, which was drawn from out previous section analysis, but also really tasteful.

Another interesting finding is that these restaurants with the top food ratings are French, which links with the overall trend of the high performance and presence of restaurants in France.

Figure 13: Bottom 10 European Restaurants based on Food Rating

In Figure 13, it is appreciated that the most of the restaurants at the bottom, posses a low food rate of 2.0. There is a slightly higher food rate of 2.5 from a restaurant from the Flunch chain. The lowest food rate is 1.5 from Don & Donna.

Figure 14: Table of the Bottom 10 European estaurants based on Food Rating

From Figure 14, it is interesting to see that the Don & Donna restaurant located in Greece, despite a low food rating, their price level is still mark as expensive. While, the French restaurant chain like Flunch, with food rating between 2.5 and 5, their price level is usually cheap, with a price around 15 euros.

Service Top and Bottom Ratings

Figure 15: Top 10 European Restaurants based on Service Rating

Figure 15 shows that the restaurants that has the highest service rating 5 out of 5 are French.

Figure 16: Bottom 10 European Restaurants based on Service Rating

Figure 16 shows the restaurants with the lowest service rating. The most common lowest rating is 2.5, followed by 2.0. The lowest service rating belongs to Don & Donna, which also has the lowest food rating as seen previously.

The Flunch chain restaurant appears again, meaning that they do not only have the a low food rating, but also a low service rating.

Value Top and Bottom Ratings

Figure 17: Top 10 European Restaurants based on Value Rating

Figure 17 shows that the restaurants that has the highest value rating, 5 out of 5, are French.

Figure 18: Bottom 10 European Restaurants based on Value Rating

Figure 18 shows that the restaurants with the lowest value rating is mostly 2. Something particular from this plot is that the Don & Donna restaurant appears again at the bottom.

Price and Rating Relation

Figure 19: Price vs Average Rating

In Figure 19, it is evident the relationship between the average rating and price are not directly proportional. This is because not because the restaurant is expensive, it has a high rating. For example, a restaurants with menu within the price range of 390 euros have a rating of 4.5, while another restaurant with a menu around 50 euros have a higher rating of 5. Thus, it is inferred that other factors, such as experience, quality and not only the price matter to the consumers when rating restaurants.

Figure 20: Most Expensive Restaurants

This logic is also reinforced when looking at the most expensive restaurants in Figure 20. Particularly, checking the Brasserie Og Restaurant NO76, a Denmark restaurant, is the most expensive restaurant in this dataset, yet has an average rating of 4.5. The average rating does not reflect a poor image of the restaurant, but it could contribute to why the restaurants is more expensive than others.

Nevertheless, when checking Au Bon Accueil, a French restaurant with an average rating of below 3.75 and average price around 275 euros. It is questionable how a restaurant can charge such a substantial price in the presence of a less than meritorious rating.

Hence, it can be concluded that while price exerts influence on the restaurant’s average rating, and viceversa, there exist additional factors—namely, the ambience, food quality, and service—that significantly shape the overall experience for each patron, thereby influencing the performance of the restaurant at large.

Exploring particular restaurants

Don & Donna

As explored before, Don & Donna restaurant has appeared at the bottom in the food, service and value rating:

restaurant_name food_rating service_rating value_rating avg_price price_level
Don & Donna 1.5 1.5 1 57.5 expensive
Figure 21: Table of Don & Donna Restaurants Chain Information

Thus, from Figure 21 it is possible to infer that the restaurant, Don & Donna is the worst one in this dataset base on the food, service and value rating. Still, it is interesting to see that even though their rating is bad, their prices are still expensive around 50 euros. This lead to think that maybe Greek restaurants are usually expensive regardless of their rating.

The Flunch

The Flush, the French chain, has appeared with the highest amount of restaurants, yet it posses the lowest food and service rating as seen previously in the EDA.

Figure 22: Table of The Flunch Restaurants Chain Information

Figure 23: Table of French Restaurants with the lowest food, service and value rating

Comparing Figure 22 and Figure 23, it is clear that the Flunch chain restaurant is not the worst restaurant in France, since that title goes to Les Chandelles and La Confiance.

Nevertheless, considering other French restaurants around the same price average an price level:

Figure 24: Table of French Restaurants with an average price similar to The Flunch

In Figure 24, it is showed that costumers can find places where they get good food without spending more, and still get great service. Restaurants like La Table de Charbon-Blanc and Restaurant Plus Belle La Vie prove that French restaurants can be easy on the wallet while giving you a great experience.

Exploring France a bit deeper

Along this EDA, France had a strong presence since most of the restaurants are located there. Also, when looking at prices and ratings the top and bottom restaurants, French restaurants appeared. Thus, I decided to particularly explore at restaurants located in France.

Price level and Ratings

Figure 25: Distribution of French Restaurants by Price Level

Figure 26: Table of French Restaurants with the highest average rating and their price level

Through Figure 25, it is evident that the majority of French restaurants fall into the mid-range category. This suggests that French restaurants offer a variety of services and cuisines that are affordable for consumers. Moreover, their service remains excellent, as indicated in Figure 26, which shows that some top-rated restaurants with an average rating of 5 also belong to the mid-range category.

Additionally, Figure 25 suggests that French restaurants cater to diverse customer budgets. There are upscale, expensive restaurants for special occasions, as well as mid-range and affordable options for casual or informal gatherings. Regardless of the price level customers are seeking, they can still find restaurants with great food, value, and service, as illustrated in Figure 26, listing restaurants from cheap to expensive, all with an average rating of 5.

How many French restaurants have an award?

Figure 27: Distribution of French Restaurants with Awards

In Figure 27, it is evident that the majority of French restaurants have received awards. This underscores the excellent culinary service that French restaurants offer. These accolades not only enhance their reputation but also contribute to the higher costs associated with some French restaurants. Winning awards can impact not only the customer experience but also the pricing of their menus.

Conclusion

In conclusion, the comprehensive analysis of the European dining landscape revealed several noteworthy insights. One of the most striking findings was France’s culinary dominance, evident in the sheer number and top rankings of its restaurants. The unexpected concentration of restaurants in France prompted further exploration into the intricate dynamics of the culinary scene on the continent.

Another finding is a widespread perception of excellence in food and service across the continent reflected in the high average ratings of European restaurants on TripAdvisor. Indeed, the top-rated European restaurants, in terms of food, are predominantly located in France.

Interestingly, exploring restaurant chains like The Flunch challenged the assumption that more locations correlate with higher ratings. This idea suggests that success in the restaurant industry goes beyond mere quantity, which was expected since, as a consumer, price, and food quality are important factors but not the only ones to consider when assessing my dining experience at a restaurant.

The interplay between price and ratings added another layer of complexity to consumer preferences. Contrary to the assumed proportional relationship, some expensive restaurants exhibited lower ratings than their more affordable counterparts. The case study of Don & Donna exemplified this phenomenon, emphasizing the idea that perceived value extends beyond the monetary aspect and that customers consider various factors in their assessments.

I was surprised by the diversity in French restaurants’ price levels, spanning from budget-friendly to upscale, all with consistently high or above-average ratings. Besides, as expected, I found that most French restaurants had won an award because of their strong presence and high food ratings. Hence, France stands out as a culinary powerhouse.

Future research could explore the impact of low open days per week on restaurant performance and delve into the correlation between specific cuisines and higher ratings. Additionally, it would be intriguing to analyze whether award-winning restaurants experience higher customer traffic, profitability, or industry recognition. These investigations could provide valuable insights and a better understanding of the European restaurant industry.